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Description 

The. present invention relates to a recombinant HIV (Human Immunodeficiency Virus) antigen. A recombinant 
antigen derived from the molecular cloning and expression in a heterologous expression system of a synthetic DNA 
s sequence of the various HIV antigens can be used as a reagent for the detection of antibodies and antigen in body 
fluids from individuals exposed to various HIV isolates. ' 

The nucleotide sequence of the proviral genome has been determined for several HIV isolates, including HIV-1 
strains HTLV-III (Ratner et al., Nature (1985) 313:277); ARV-2 (Sanchez-Pescador et afl., Science (1985) 227:484); 
LAV (Wain-Hobson et al., Ce//(1985) 40:9); and CDC-451 (Desai et al., Proc. Natl. Acad. Sci. USA (1986) 83:8380). 
10 The nucleotide sequence of the HI V-2 ROD isolate was reported by Guyader et aL (Nature (1 987) 326:662). 

HIV antigens have been obtained from the virus grown in tissue culture, or from a molecularly cloned genomic 
fragment expressed in heterologous hosts such as Escherichia coli. The tissue culture derived virus involves the cum- 
bersome and often difficult process of growing virus infected cells in stringent sterile conditions. Further, the virus 
derived from tissue culture is infectious, and, therefore is hazardous to the health of individuals involved in propagation 
is and purification. The expression of molecularly cloned HIV genomic fragments overcomes the biohazard problem. 
Generally, an HIV genomic fragment from a single HIV isolate with mammalian codons is expressed in a heterologous 
system, such as, bacteria or yeast, and is limited to the use of available restriction sites present in the viral genome 
for cloning and expression. 

It has been difficult to obtain expression in heterologous systems of some of the HIV proteins, such as the HIV-1 
20 envelope antigen gp41 . Several researchers have tried deleting the hydrophobic regions of the HIV-1 gp41 to increase 
expression levels. UK Patent Application GB 2188639 discloses an HTLV-III gag/env gene protein wherein the env 
fragment of the DNA sequence deleted codons corresponding to the first hydrophobic region of the gp41 protein. U. 
S. Patent No. 4,753,873 discloses a peptide fragment that is encoded by a nucleotide sequence wherein the nucleotides 
coding for a first and second hydrophobic region of HTLV-III gp41 are deleted. 
25 Poor expression can be the result of many factors, including the specific nucleic acid sequence of the gene to be 

expressed, the mammalian codons of the gene sequence to be expressed may not be efficiently transcribed and trans- 
lated in a particular heterologous system, and the secondary structure of the transcribed messenger RNA. The use of 
synthetic DNA fragments can increase expression in heterologous systems. 

30 SUMMARY OF THE INVENTION 

A recombinant antigen which is derived from the molecular cloning and expression of a synthetic DNA sequence 
in heterologous hosts is provided. A synthetic DNA sequence coding for the recombinant antigen of the invention is 
further provided. The synthetic DNA sequence selected for expression of various HIV antigens is based on the amino 
35 acid sequence of either a single isolate or several isolates, optimized for expression in Escherichia coli by specific 
codon selection. The synthetic DNA sequence gives higher expression of the particular antigen encoded. This antigen 
can be substituted for viral antigens derived from tissue culture for use as diagnostic and therapeutic reagents. 

The present invention can be utilized to synthesize full length HIV transmembrane envelope gene using bacterial 
codons. Another aspect of the invention involves the linkage of sequences which are poorly expressed as individual 
40 proteins, to sequences which are expressed with high efficiency. The combination of the sequence of the entire coding 
region of a gene of one virus with coding sequences of another gene from a different virus to produce a fusion protein 
' can be achieved. The fusion proteins thus expressed have a unique advantage of antigenic epitopes of two viral anti- 
gens. 

The present invention includes a full length synthetic gene (FSG) for HIV-1 transmembrane glycoprotein (TMP). 

45 

DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates the alignment of the TMP fragment encoding amino acid residue nos. 552-668 of HIV-1 with the 
sequences of the four different isolates used to derive the amino acid sequence of BS2-10. 
50 Fig. 2 illustrates the assembly of 16 oligonucleotides to form the synthetic TMP fragment of Fig. 1 , and its cloning 

into pUCl8, designated BS2-10. 

Fig. 3 illustrates the DNA and amino acid sequence of FSG, indicating the restriction sites and subfragments used 
for assembly. 

Fig. 4 is a comparison of the amino acid sequence used to develop the synthetic HIV-1 envelope gene with known 
55 amino acid sequences of 1 3 independent isolates, indicating all linker-derived sequences (+) and amino acid substi- 
tutions (*). 

Fig. 5 is a schematic diagram of the assembly and cloning of the major subfragments to form FSG in pUC18. 
Fig. 6 is a schematic diagram of the cloning of FSG into lambda pL expression vectors to generate pSD301 and 
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pSD302. 

Figs. 7 illustrates the amino acid sequences of pSD30.1 and pSD302, indicating all linker-derived sequences (+) 
' and amino acid substitutions (*). 

Fig. 8 illustrates results of expression analysis of pSD301 and pSD302. A) Coomassie stained gel; B) Immunoblot 
5 using AIDS patients' sera. 

DETAILED DESCRIPTION OF THE INVENTION 

Synthetic DNA fragments of the HIV genome can be synthesized based on their corresponding amino acid se- 

10 quences. By comparing the particular region of interest between different isolates, a sequence can be selected which 
is different from any sequence that exists in nature, because the sequence is a compilation of the sequences from 
various isolates. For example, the synthetic HIV-1 envelope protein described in Example 1, is based on the amino 
acid sequence of four different HIV 1 isolates, namely, HTLV-IIIB, LAV-1 , ARV-2 and CDC-451 . 

Other properties can be built into the sequence. For example, codons can be switched for optimal expression in 

15 bacteria or yeast, specific restriction sites can be introduced, and other restriction sites can be removed. In addition, 
the sequence should have specific restriction sites at both 5' and 3' ends of the fragment to facilitate cloning in a 
particular vector. Synthetic DNA fragments can be synthesized as follows: (1) select an unique protein sequence, (2) 
reverse translate to determine complementary DNA sequence, (3) optimize codons for bacterial or yeast expression, 
and (4) introduce and/or remove specific restriction sites. 

20 . Sixty-one distinct nucleotide codons code for 20 amino acids giving rise to the degeneracy in the genetic code. 
Researchers have reported the frequencies of codons used in nucleic acids for both eukaryotic and prokaryotic organ- 
isms. (Grantham et al., Nucleic Acids Res. [1980] 9:r43; Gouy et al., Nucleic Acids Res. [1982] 10:7055; Sharp et al., 
Nucleic Acids Res. [1986] 14:7737.) Sequences from the entire E coli genome, with examples of sequences from the 
chromosome, transposons, and plasmids, have been analyzed. These sequences code for structural proteins, enzymes 

25 and regulatory proteins. Correlation has been shown between the degree of codon bias within a particular gene and 
the level of gene expression. 

It is preferred that the same codon triplet for each particular amino acid of the synthetic DNA sequence be used. 
However, alternative codon(s) can be used to add or delete a particular restriction site. The sequence should include 
unique restriction sites which can be used to delete a specific fragment or sequence, or substitute a particular sequence 

30 in case of an error in the original synthesis. 

Vector systems which can be used include plant, bacterial, yeast, insect, and mammalian expression systems. It 
is preferred that the codons are optimized for expression in the system used. The proteins and polypeptides provided 
by the invention, which are cloned and expressed in heterologous systems, as described above, can be used for 
diagnostic and therapeutic purposes. 

35 a preferred expression system utilizes the lambda pL vector system. This expression system has the following 

features: (1) a strong lambda pL promoter, (2) a strong three-frame translation terminator rmBtl, and (3) translation 
starts at an ATG codon, eight base pairs from the ribosome binding site located within an accessible Ncol restriction site. 

Another advantage of the expression system of the present invention is that one can customize the synthetic DNA 
fragments so they contain specific DNA sequences which express proteins with desired amino acid sequences, and 

40 further allows one the capability of adding, at either the 5' or 3' end, other DNA sequences to facilitate the transfer of 
synthetic fragments into various vectors. 

Additionally, the use of particular restriction sites at both ends of the fragment may also facilitate incorporation of 
the fragment into other sequences to generate fusion proteins, which can also be used as diagnostic and therapeutic 
reagents. For example, the HIV-1 gp41 sequence can be incorporated within or at the end of core/surface antigen of 

45 the hepatitis B viral sequence to generate a fusion protein which can be used in a single assay screening system for 
the detection of both AIDS and Hepatitis B in prospective blood donors. Alternatively, the assay can be used to track 
the course of a patient's infection. 

Other proteins from any source, including bacterial, yeast, insect, plant or mammalian, can be used with the syn- 
thetic DNA fragments of the invention to produce fusion proteins. Those which are expressed efficiently in their re- 

50 spective expression systems are especially preferred because they can enhance the expression of the synthetic frag- 
ment of the fusion protein. 

The synthetic DNA sequences of the present invention, derived from several HIV isolates and optimized for ex- 
pression in E. coli, provides continuous availability and uniformity of HIV antigen preparations which will recognize test 
sera from individuals exposed to genetically distinguishable variant HIV isolates. The recombinant antigen expression 
55 is very stable since E. coli codons have been used for its synthesis. Moreover, the insertion of specific restriction sites 
allows addition, deletion, or substitution in important antigenic epitopes in the coding sequences, a property difficult to 
achieve when naturally occurring HIV sequences are utilized for expression. Construction of synthetic genes also 
allows the addition of protein sequences at either amino- or carboxyl- terminus of the gene thereby allowing the de- 
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velopment of novel reagents. For example, a fusion gene can be produced comprising a fusion between HIV-1 core 
antigen and HIV-1 envelope synthetic gene. More specifically the envelope synthetic gene comprises the carboxyl- 
terminus HIV-1 gp120 sequence and the full length HIV-1 gp41 . Similarly, the HIV-1 core antigen DNA sequence can 
be fused to the HIV-2 gp41 sequences, both of which can be expressed at high levels in heterologous host systems 
s such as E. coli. 

E. coli strains containing plasmids useful for constructs of the invention have been deposited at the American Type 
Culture Collection, Rockville, Maryland, on November 22, 1988, under the accession nos. ATCC 67855 
(pSD301/RR1/pRK248.clts) and ATCC 67856 (pSD306/CAG456). 

The following examples further describe the invention. The examples are not intended to limit the invention in any 
10 manner. 

Reagents and enzymes 

Media such as Luria-Bertani (LB) and Superbroth II (Dri Form) were obtained from Gibco Laboratories Life Tech- 
'5 nologies, Inc., Madison, Wisconsin. Restriction enzymes, Klenow fragment of DNA polymerase I, T4 DNA ligase, T4 
polynucleotide kinase, nucleic acid molecular weight standards- M13 sequencing system, X-gal (5-bromo-4-chloro- 
3-indonyl-p-D-galactoside), IPTG (isopropyl-p-D-thiogalactoside), glycerol, Dithiothreitol, 4-chloro-1-napthol were pur- 
chased from Boehringer Mannheim Biochemicals, Indianapolis, Indiana; or New England Biolabs, Inc., Beverly Mas- 
sachusetts; or Bethesda Research Laboratories Life Technologies, Inc., Gaithersburg, Maryland. Prestained protein 
20 molecular weight standards, acrylamide (crystallized, electrophoretic grade >99%); N-N'-Methylene-bis-acrylamide 
(BIS); N.N.N'.N'rTetramethylethylenediamine (TEMED) and sodium dodecylsulfate (SDS) were purchased from Bio- 
Rad Laboratories, Richmond, California. Lysozyme and ampicillin were obtained from Sigma Chemical Co., St. Louis, 
Missouri. Horseradish peroxidase (HRPO) labeled secondary antibodies were obtained from Kirkegaard & Perry Lab- 
oratories, Inc., Gaithersburg, Maryland. Seaplaque agarose (low melting agarose) was purchased from FMC Bioprod- 
25 ucts, Rockland, Maine. 

T50E10 contained 50 mM Tris, pH 8:0, 10 mM EDTA; 1X TG contained 100 mM Tris, pH 7.5 and 10% glycerol; 
2X SDS/PAGE loading buffer consisted of 1 5% glycerol, 5% SDS, 1 00 mM Tris base, 1 M p-mercaptoethanol and 0.8% 
Bromophenol blue dye; TBS contained 50 mM Tris, pH 8.0, and 150 mM sodium chloride; Blocking solution consisted 
of 5% Carnation nonfat dry milk in TBS. 

30 

Host cell cultures, DNA sources and vectors 

E. coli JM1 03 cells, pUC8, pUC18, pUC19 and M13 cloning vectors were purchased from Pharmacia LKB Bio- 
technology, Inc., Piscataway, New Jersey; Competent Epicurean™ coli strains XL1-Blue and JM109 were purchased 
35 from Stratagene Cloning Systems, La Jolla, California. RR1 cells were obtained from Coli Genetic Stock Center, Yale 
University, New Haven, Connecticut; and E. co// CAG456 cells from Dr. Carol Gross, University of Wisconsin, Madison, 
Wisconsin. Vector pRK248.clts was obtained from Dr. Donald R. Helinski, University of California, San Diego, California. 



40 General methods 

All restriction enzyme digestions were performed according to suppliers' instructions. At least 5 units of enzyme 
were used per microgram of DNA, and sufficient incubation was allowed to complete digestions of DNA. Standard 
procedures were used for mini cell lysate DNA preparation, phenol-chloroform extraction, ethanol precipitation of DNA, 
45 restriction analysis of DNA on agarose, and low melting agarose gel purification of DNA fragments (Maniatis et al., 
Molecular Cloning. A Laboratory Manual[Nevj York: Cold Spring Harbor, 1982]). Plasmid isolations from E. coli strains 
used the alkali lysis procedure and cesium chloride-ethidium bromide density gradient method (Maniatis et al., supra). 
Standard buffers were used for T4 DNA ligase and T4 polynucleotide kinase (Maniatis et al., supra). 

so EXAMPLES 

Example 1 

Cloning strategy of codonoptimized synthetic HIV-1 envelope protein 

55 ' — — 

In order to develop a synthetic gene encoding the HIV-1 envelope glycoprotein and fragments thereof, the amino 
acid sequences of four independent HIV-1 viral isolates designated as HTLV-IIIB (BH102), LAV-1 (MAL), ARV-2 (SF), 
and CDC-451 (CDC42) were compared. A unique amino acid sequence from the four isolates (Fig. 1 ) was selected to 
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derive a fragment with amino acid residues nos. 552-668 (numbering by Ratner et al., supra). This fragment contained 
nine amino acid substitutions (8%) as compared to the HTLV-IIIB (BH102) isolate. This amino acid sequence was 
reverse translated using codons optimized to facilitate high level expression in E. coli. The ambiguous nucleotides 
remaining in the second and/or third base of the codon were assigned to facilitate molecular cloning, and the addition, 

s substitution, or deletion of sequences. The DNA sequence was then subdivided into eight double stranded fragments 
with unique 6 bp overhangs to direct specific annealing. The sixteen individual oligonucleotides were synthesized on 
Applied Biosystem 380A DNA synthesizer using methods and reagents recommended by the manufacturer. These 
purified oligonucleotides were annealed and ligated together to assemble the entire fragment which was digested with 
BamHI and Sail, ligated into pUC18 and transformed into E. co//JM103 cells. A clone designated BS2-10 (Fig. 2) was 

10 isolated, restriction mapped and its DNA sequence confirmed using the Sanger dideoxy chain termination method 
(Sanger et al., J. Mol. Biol (1982) 162:729). 

In order to establish that clone BS2-10 expressed this unique HI V-1 transmembrane protein (TMP) fragment, the 
BS2-10/JM1 03 culture was grown at 37°C in 50 ml Luria Broth, in a 250 ml Erlenmeyer flask. When the cultures reached 
an OD600 of 0.3-0.5, IPTG was added to a final concentration of 1 mM to induce expression. Samples (1.5 ml) were 

is removed at 1 hr intervals, and the cells were pelleted and resuspended to an OD600 of 1 0.0 in 2X SDS/PAGE loading 
buffer. Aliquots (1 5 uJ) of the prepared samples were loaded on a 1 5% SDS/PAGE gel, and the proteins were separated 
and then electrophoretically transferred to nitrocellulose for immunoblotting. The nitrocellulose sheet containing the 
transferred proteins was incubated with Blocking Solution for one hour and incubated overnight at 4° C with AIDS 
patients' sera diluted in TBS containing 5% E. coli JM1 03 lysate. The nitrocellulose sheet was washed three times in 

20 TBS, then incubated with HRPO-labeled goat anti-human IgG, diluted in TBS containing 10% fetal calf sera. The 
nitrocellulose was washed three times with TBS and the color was developed in TBS containing 2 mg/rnl 4-chloro- 
1-napthol, 0.02% hydrogen peroxide and 17% methanol. Clone BS2-10 demonstrated a strongly immunoreactive band 
with AIDS patients' sera indicating that the synthetic HI V-1 TMP fragment was expressed in E. coli. In order to assemble 
the full length HI V-1 transmembrane protein, as well as the extreme carboxyl-terminal 37 amino acids of gp120, the 

25 amino acid sequences of the four isolates described previously were compared to each other to derive a unique amino 
acid sequence for this gene. After this unique amino acid sequence was reverse translated using codons optimized 
for E. coli expression, the ambiguous nucleotides were assigned as previously described. The full length synthetic HIV- 
1 envelope gene (FSG) was divided into six additional subfragments. The complete DNA and amino acid sequence of 
FSG is shown in Fig. 3, indicating the restriction sites and subfragments used for assembly. Fig. 4 is a comparison of 

30 the amino acid sequence used to develop the synthetic HIV-1 envelope gene with known amino acid sequences of 1 3 
independent isolates reported in the Los Alamos HIV Data Bank (Meyers et al., Human Retroviruses and AIDS (1 987), 
Los Alamos National Laboratory). The Genalign program of Intelligenetics was used to align these sequences, and 
the alignment demonstrates that FSG (designated SYNGENE in Fig. 4) retains substantial overall sequence homology 
compared to other known isolates. Alignment parameters and alignment scores of the individual sequences are also 

35 shown. 

Synthesis and cloning of subfragments 

The subfragments located downstream from BS2-10, designated 413-1 through 413-4, were synthesized along 

^0 with additional sequences containing a BamHI restriction site at the 5' end and a Hindi 1 1 restriction site at the 3' end 
to facilitate molecular cloning and DNA sequence analysis of the individual subfragments. The subfragments located 
upstream of BS2-10 were also synthesized with additional sequences containing restriction sites useful for cloning and 
DNA sequence analysis. The subfragment encoding the carboxyl-terminal gp1 20 amino acid sequence, designated c- 
term gp120, contained EcoRI and BamHI restriction sites on the 5' end and Bglll and Smal restriction sites on the 3' 

<5 end. Similarly, subfragment 415 contained a Bglll site on the 5' end and Bglll and BamHI restriction sites on the 3 
end. With the exception of the c-term gp120 subfragment, in which both strands were synthesized as described for 
BS2-10, the remaining subfragments of FSG were synthesized by a method utilizing the Klenow fragment of DNA 
polymerase I. In this method, oligonucleotides comprising opposite strands of a particular subfragment, which contained 
ten complementary bases, were synthesized and annealed. The second complementary strand was then filled in by 

50 the Klenow fragment of DNA polymerase I in the presence of the four deoxynucleotides in a manner similar to that 
described by Sanger et al., supra, for DNA sequencing. The resulting double-stranded subfragment was then digested 
with the appropriate restriction enzymes and cloned into pUC vectors to confirm the DNA sequence, as previously 
described. Subfragments 413-1 through 413-4 were cloned into pUC18 using the BamHI and Hindlll restriction sites 
common to all. Subfragment c-term gp120 was cloned into pUC8 using the EcoRI and Smal restriction sites. Subfrag- 

55 nrient 415 was cloned into the plasmid containing c-term gp120 at the Bglll restriction site and screened for proper 
orientation by restriction mapping. The plasmid DNAs for all subfragments were prepared by the cesium chloride buoy- 
ant density gradient method and the individual DNA sequences were confirmed directly from the double-stranded 
template (Hattori et al., Nucl. Acid Res. (1985) 13:7813). 
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Assembly and cloning of FSG 

Subfragments located downstream from BS2-1 0 were cloned in a stepwise fashion utilizing unique internal restric- 
tion sites at the 5' end and a common Hindlll site at the 3' end. For example, subfragment 413-1 was cloned into 

5 BS2-10 at the Sail and Hindlll restriction sites to generate clone BS2-10A, into which 41 3-2 was inserted at the Hpal 
and Hindlll restriction sites to generate clone BS2-108. Similarly, subfragments 413-3 and 413-4 were added using 
unique EcoRV and SnaBI restriction sites, respectively. The two subfragments located upstream of clone BS2-10, 
having been cloned together in p(JC8, were ligated to BS2-10 as a BamHI fragment. Fig. 5 shows the cloning method 
used to assemble the synthetic HIV-1 envelope gene in pUC18. The final clone, designated FSG, was restriction 

10 mapped to confirm the proper orientation of the BamHI -BamHI fragment. 

Example 2 

Cloning and expression of FSG in lambda pL Vector Systems 

15 

Expression analysis of FSG was carried out in vector systems utilizing the strong lambda pL promoter and the 
temperature sensitive cl repressor gene (Benard et al. , Gene (1 979) 5:59). The specific vectors used in these analyses 
are derivatives of pBR322, containing a lambda pL promoter and a synthetic Shine-Dalgamo sequence, followed by 
restriction sites used for cloning various genes of interest. In addition, these vectors contain the strong three-frame 

20 translation terminator rrnBM. Vector pSDKR816 contains a Ncol restriction site which provides an ATG start codon 
optimally spaced from the start of transcription. Fig. 6 schematically presents the cloning of FSG into pSDKR816 to 
generate clone pSD301 . Briefly, FSG was digested with Hindlll and Smal, the ends were made blunt by filling in with 
the Klenow fragment of DNA polymerase I, and the 1 209 bp fragment was purified and ligated into pSDKRBI 6 at the 
Ncol site filled in with the Klenow fragment of DNA polymerase I. After transformation into E. coli RR1 cells containing 

25 the cits gene on the compatible vector pRK248, a clone with FSG in the proper orientation was isolated by restriction 
mapping and designated pSD301. The specific amino acid sequence encoded by pSD301 is presented in Fig. 7 indi- 
cating ail linker derived sequences (+) and all amino acid substitutions within the HIV-1 envelope sequences not yet 
identified in any published sequence (*). 

Additionally, FSG was cloned as a fusion to the HIV-1 gag protein (amino acid residue nos. 121-407, numbering 

30 by Ratner et al., supra) which is highly expressed under control of the lambda pL promoter in vector pKRR955. FSG 
was digested with Aval, the ends were made blunt by filling in with the Klenow fragment of DNA polymerase I, and the 
1199 bp fragment was purified and ligated into pKRR955 at the Smal restriction site to form. an HIV-1 gag/synthetic 
env fusion protein (Fig. 6). After transformation into E. coli pRK248.clts/RR1 cells, a clone containing FSG in the proper 
orientation was identified by restriction mapping and designated pSD302. The specific amino acid sequence of this 

35 fusion protein is presented in Fig. 7 indicating all linker derived sequences, HIV-1 gag sequences, and HIV-1 envelope 
sequences as previously described. 

Fifty ml cultures of pSD301 and pSD302 in E. coli pRK248.clts/RR1 cells were grown in Superbroth II media at 
30°C to an OD600 of 0.5, at which time the cultures were shifted to 42°C to inactivate the temperature sensitive cl 
repressor and thereby induce expression off the lambda pL promoter. Two samples (2.0 ml each) were removed at 1 

40 hr intervals. Sample preparation was as follows. 

The cells were pelleted, then resuspended in either 1X TG buffer or T50E10 buffer. An equal volume of 2X SDS/ 
PAGE loading buffer was added to the 1 X TG suspended cells to produce the whole lysate. The sample resuspended 
in T50E10 was sonicated eight times for 30 seconds each, at a power setting of 10 watts, using the microtip provided 
with the Vibra Cell Sonicator (Sonics and Materials, Inc., Danbury, CT). The sonicated sample was then centrifuged 

45 to remove the insoluble fraction which was resuspended in the original starting volume of T50E10. An equal volume 
of 2X SDS/PAGE loading buffer was added to both the sonicated soluble and insoluble fractions, which together with 
the whole cell lysate, were boiled for 5 min, centrifuged to remove any remaining insoluble material, and aliquots (15u.l) 
were separated on duplicate 12.5% SDS/PAGE gels. Proteins from one such gel were electrophoretically transferred 
to nitrocellulose for immunoblotting with AIDS patients' sera, as previously described. The second gel was fixed in a 

50 solution of 50% methanol, 10% acetic acid for twenty minutes at room temperature, and then stained with 0.25% 
Coomassie blue dye in a solution of 50% methanol, 10% acetic acid for 30 minutes. Destaining was carried out using 
a solution of 10% methanol, 7% acetic acid for 3-4 hr, or until a clear background was obtained. 

Fig. 8 presents the expression of pSD301 and pSD302 prior to (TO) and four hours post (T4) induction, analyzed 
by Coomassie blue staining (Fig. 8A) and immunoblotting (Fig. 8B). Samples were pKRR955 (TO whole cell lysate 

55 [lane 1], T4 whole cell lysate [lane 2]); pSD301 (TO whole cell lysate [lane 3], T4 whole cell lysate [lane 4], T4 sonicated 
soluble fraction [lane 5], and T4 sonicated insoluble fraction [lane 6]); and pSD302 (TO whole cell lysate [lane 7], T4 
whole cell lysate [lane 8], T4 sonicated soluble fraction [lane 9], and T4 sonicated insoluble fraction [lane 1 0]). Molecular 
weight standards were run in lane 11 . Arrows indicate the position of the induced proteins which are clearly visualized 
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in both the whole cell lysate and the sonicated insoluble cell fraction by Coomassie blue staining (Fig. 8A). Lane 2 
indicates that pKRR955 expressed the HIV-1 gag protein at a level greater than 25% of total cellular protein, lane 4 
' indicates that pSD301 expressed the synthetic HIV-1 envelope protein at a level of approximately 1 2% of total cellular 
protein, arid lane 8 indicates that pSD302 expressed the HIV-1 gag/synthetic env fusion protein at a level of approxi- 
s mately 5% of total cellular protein. The expression levels obtained using FSG were significantly higher than those 
obtained using the corresponding native viral DNA sequences in similar pL vector systems. All three recombinant 
proteins were highly reactive with AIDS patients' sera (Fig. 8B). This data demonstrates that the synthetic HIV-1 en- 
velope gene, including the hydrophobic region of the transmembrane protein, can be efficiently expressed in E coii, 
and the expressed proteins are highly immunoreactive. 

10 

Diagnostic utility of synthetic DNA derived HIV proteins 

The HIV specific proteins overexpressed in E. coli were purified using procedures known in the art. The proteins 
expressed at high levels were immunogenic and were recognized by antibodies produced in HIV-infected individuals 

is (see fig. 8). The HIV specific proteins derived from £ coli can be utilized in several immunoassay configurations, as 
described in CIP application U.S. Serial No. 020,282, filed February 27, 1987 by Dawson et al., which is hereby incor- 
porated. by reference. The parent application is EPO 86116854.0 (December 4, 1986). In a preferred configuration, 
HIV specific proteins were coated on solid support and incubated with test samples. The virus specific antibodies 
present in the test sample recognized and were bound to the HIV proteins on the solid support. The HIV specific 

20 antibodies were quanfitated by the use of goat anti-human immunoglobulin conjugated to HRPO. 

Biological samples which are easily tested by the methods of the present invention include human and animal 
body fluids such as whole blood, serum, plasma, urine, saliva, stools, lymphocyte or cell culture preparations and 
purified and partially purified immunoglobulins. The polypeptide described herein can be used to determine the pres- 
ence or absence of antibodies to HIV-1 antigen by assay methods known to those skilled in the art. 

25 One such assay involves: 

a) coating a solid support with the polypeptide disclosed herein; 

b) contacting the coated solid support with the biological sample to form an antibody polypeptide complex; 

c) removing unbound biological sample from the solid support; . 

30 d) contacting the complex on the solid support with a labeled immunoglobulin specific for the antibody; and 

e) detecting the label to determine the presence or absence of HIV-1 antibodies in the biological sample. 

A second assay method involves: 

35 a) coating a solid support with the polypeptide disclosed herein; 

b) contacting the coated solid support with the biological sample and the homologous polypeptides conjugated to 
a label; 

c) removing unbound biological sample and unbound labeled polypeptide; and 

d) detecting the label to determine the presence or absence of HIV-1 antibodies in the biological sample. 

40 

Solid supports which can be used in such immunoassays include wells of reaction trays, test tubes, beads, strips, 
membranes, filters, microparticles or other solid supports which are well known to those skilled in the art. Enzymatic, 
radioisotopic, fluorescent, chemiluminescent and colloidal particle labels can be used in the aforementioned assays. 
Furthermore, hapten/labeled anti-hapten systems such as a biotin/labeled ariti-biotin system can be utilized in the 

45 inventive assays. Both polyclonal and monoclonal antibodies are useful as reagents, and IgG as well as IgM class HIV 
antibodies may be used as solid support or labeled reagents. 

It is evident from the foregoing examples that one skilled in the art could clone together specific subfragments of 
the synthetic genes constructed to generate new synthetic genes that would have the same characteristics as those 
illustrated herein. For example, thec-termgp120subfragment, BS2-10and sub-fragment 41 3-1 can be cloned together 

50 to produce synthetic gene products useful as diagnostic and therapeutic reagents for AIDS. 



Claims 

55 

Claims for the following Contracting States : AT, BE, CH, DE, FR, GB, GR, IT, LI, NL, SE 

1 . A polypeptide comprising an amino acid sequence represented by the following: 
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y?5,^SS^H^ KYKVVKIEPL6IAPTKAKRRVV Q REKR ADLAVGILGALFLGFLGAAGSTMGARSL 
Ikiy9»5Pm«^H^Q NN LLRAIKDPKAQQHLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKL 
ICJIAVPWNASWSN^ 

NWSNITKWLWYIKLFIMIVGGLAGLRIVFAVLSIVNRVRQGYSPLS FQTRLPNPRGPDRPEGIDEEGGER 
DRDRSTRLVOISLALVWEDLRSLCLFSYHRLRDLLLIATRIVELLGRRGWEVLKYWWNLLQYVSQELKNS 
AVSLVNATAIAVAEGTDRVIEVVQRAYRAIRHIHRRIRQGLERILLQVHASSLESSWQFGPG. 

The polypeptide of Claim 1 produced by E. colj. 

A fusion polypeptide comprising a polypeptide as in Claim 1 , in which the polypeptide is fused to a prokaryotic or 
eukaryotic protein. 

The fusion polypeptide of Claim 3 wherein said prokaryotic or eukaryotic protein is the E. coH enzyme CKS. 
A synthetic gene comprising a DNA sequence represented by the following: 



ATGGGGGATCCCATGATGC^ 

CGCTGGGCATCGCTCCGACCAAAGCTAAACGCCGCGTTGTTCAGCGCGAAAAACGCGCAGATCTAGCTGT 

TGGTATCCTGGGTGCTCTGTTTCTGGGTTTTCTGGGTGCTGCTGGTTC^ 

ACTCTGACTG7TCAGGCTCGCCAGCTGCTGTCTGGTATCGTTCAGW 

TCAAGGATCCCAAAGCTCAGW 

"^CTGGCTGTTGAACGCTACCT^ 

ATTTGCACTACTGCCGTTCCGTG^ 

JCAGAACCAGCAGGAAAAAAACGAACAGGAACTG 
AACTGGTCTAACATAACTAA^ 

^^^?^^^niI^I5I^I5 T $ T ^I CGTTAACCGCGTTC GCCAGGGTTACTCTCCGCTGTCTTT 
JCAGACTCGCCTGCCGAACCCGCGCGGTCCGGACCGCCCGGAAGGTATCGATGAAGAAGGTGGTGAACGC 
GACCGCGACCGgCTACTCGCCTGGTAGATATCTCTCTGGCTCTG 

i^I G J'IJ^JJ A ^ A JCGCCTGCGCGACCTGCTGCTGATCGCTACTCGCATCGTTGAACTGCTGGGTCG 
CCGCGGTTGGGAAGTGCTGAAATACTGGTGGAACCTGCTGCAATACGTATCTCAGGAACTGAAAAACTCT 




The synthetic gene of Claim 5 coding for the polypeptide of Claim 1 . 

A fusion polypeptide comprising a polypeptide as in Claim 1 , in which the polypeptide is fused to a HI V gag protein. 

The fusion polypeptide of Claim 7 wherein said HIV gag protein is an HIV-1 gag protein comprising an amino acid 
sequence represented by the following: 



DTGHS5QVSQNYPIVQHIQG0MVHQAISPRTLNAWVKVVEEKAFSPEVIPMFSALSEGATPGDLNTMLNT 
VGGHOAAMOMLKETINEEAAEWDRyHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQIGwMTNNPPIPVGE 
IYKRWIILGLNKIVRMYSPTSILDIROGPKEPFRDYVDRFYKTLRAEQASQEVKNWMTETLLVONANPDC 
KTILKALGPAATLEEHHTACQGVGGPGKKARVLAEAHSQVTNTATIMHQRGNFRNQRKMVKCFiNCGKEGH 
TARNCRA 



A method for detecting antibodies to HIV antigens in an individual which comprises the steps of: 

a) obtaining a sample of a body fluid from the individual: 

b) incubating said body fluid with said polypeptide of Claim 1 ; 

c) incubating said body fluid with a labeled antibody to immunoglobulin; and 

d) detecting said label and determining therefrom the presence or absence of antibodies to HIV antigens. 
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10. A method for detecting antibodies to HIV antigens in an individual which comprises the steps of: 

a) obtaining a sample of a body fluid from the individual: 

b) incubating said body fluid with said polypeptide of Claim 1; 

c) incubating said body fluid with a labeled antigen; and 

d) detecting said label determining therefrom the presence or absence of antibodies to HIV antigens. 
Claims for the following Contracting State : ES 

1. A method for detecting antibodies to HIV antigens in an individual which comprises the steps of: 

a) obtaining a sample of a body fluid from the individual: 

b) incubating said body fluid with a polypeptide comprising an amino acid sequence represented by the fol- 
lowing: 

MGDPMMRDNWRSELYKYKVVKIEPLG1APTKAKRRVVQREKRADLAVGILGALFLGFLGAAGSTMGARSL 
TLTVQARQLLSGIVQQQNNLLRAIKDPKAQQHLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKL 
ICTTAVPWNASW5NKSLEDIWNNMTWMQWEREINNYTNLIYSLLEESQNQQEKNEQELLQLDKWVDASLW 
NWSNITKWLWYIKLFIHIVGGLAGLRIVFAVLSIVNRVRQGYSPLSFQTRLPNPRGPDRPEGIDEEGGER 
DRDRSTRLVDISLALVWEDLRSLCLFSYHRLRDLLLIATRIVELLGRRGWEVLKYWWNLLQYVSQELKNS 
AVSLVNATAIAVAEGTDRVIEVVQRAYRAIRHIHRRIRQGLERILLQVHASSLESSWQFGPG. 



c) incubating said body fluid with a labeled antibody to immunoglobulin; and 

d) detecting said label and determining therefrom the presence or absence of antibodies to HIV antigens. 

2. A method for detecting antibodies to HIV antigens in an individual which comprises the steps of: 

a) obtaining a sample of a body fluid from the individual: 

b) incubating said body fluid with a polypeptide comprising an amino acid sequence represented by the fol- 
lowing: 

MGDPMMRDNWRSELYKYKVVKIEPLG'IAPTKAKRRVVQREKRADLAVGILGALFLGFLGAAGSTMGARSL 
TLTVQARQLLSGIVQQQNNLLRAIKDPKAQQHLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKL 
ICTTAVPWNASWSNKSLEDIWNNM^ 

NWSNITKWLWYIKLFIHIVGGLAGLRIVFAVLSIVNRVRQGYSPLS FQTRLPNPRGPDRPEGIDEEGGER 
DRDRSTRLVOISLALVWEDLRSLCLFSYHRLRDLL 

AVSLVNATAIAVAEGTDRVIEVVQRAYRAIRHIHRRIRQGLERILLQVHASSLESSWQFGPG. 

c) incubating said body fluid with a labeled antigen; and 

d) detecting said label and determining therefrom the presence or absence of antibodies to HIV antigens. 

3. A method for producing a polypeptide comprising an amino acid sequence represented by the following: 

HGDPMMRDNWRSELYKYKVVKIEPLGIAPTKAKRRVVQREKRADLAVGILGALFLGFLGAAGSTMGARSL 
TLTVQARQLLSGIVQQQNNLLRAIKDPKAQQHLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKL 
ICTTAVPWNASWSNKSLEDIWNNMTWHQWEREINNYTNLIYSLLEESQNQQEKNEQELLQLDKWVDASLW 
NWSNITKWLWYIKLFIHIVGGLAGLRIVFAVLSIVNRVRQGYSPLSFQTRLPNPRGPDRPEGIDEEGGER 
DRDRSTRLVDISLALVWEDLRSLCLFSYHRLRDLLLIATRIVELLGRRGWEVLKYWWNLLQYVSQELKNS 
AVSLVNATAIAVAEGTDRVIEVVQRAYRAIRHIHRRIRQGLERILLQVHASSLESSWQFGPG. 

said method comprising the molecular cloning and expression in a heterologous host of a synthetic gene comprising 
a DNA sequence represented by the following: 
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ATGGGGGATCCCATGATGCGCGACAACTGGCGCTCTGAACTGTACAAATACAAAGTTG7TAAAATCGAAC 
CGCTGGGCATCGCTCCGACCAAAGCTAAACGCCGCGTTGTTCAGCGCGAAAAACGCGCAGATCTAGCTGT 
TGGTATCCTGGGTGCTCTGTTTCTGGGTTTTCTGGGTGCTGCTGGTTCTACTATGGGTGCTCGCTCTCTG 
ACTCTGACTGTTCAGGCTCGCCAGCTGCTGTCTGGTATCGTTCAGCAGCAGAACAACCTGCTGCGCGCTA 
TCAAGGATCCCAAAGCTCAGCAGCATCTGCTGCAACTGACTGTTTGGGGTATCAAACAACTGCAGGCTCG 
CGTTCTGGCTGTTGAACGCTACCTGAAAGACCAGCAGCTGCTGGGTATCTGGGGTTGCTCTGGTAAACTG 
ATTTGCACTACTGCCGTTCCGTGGAACGCTTCTTGGTCCAACAAATCTCTGGAAGACATCTGGAACAACA 
TGACTTGGATGCAATGGGAACGCGAAATCAACAACTACACTAACCTGATCTACTCTCTGCTGGAAGAATC 
TCAGAACCAGCAGGAAAAAAACGAACAGGAACTGCTGCAACTGGACAAATGGGTCGACGCTTCTCTGTGG 
AACTGGTCTAACATAACTAAATGGCTGTGGTACATCAAACTGTTTATCATGATCGTTGGTGGTCTGGCCG 
GCCTGCGCATCGTTTTTGCTGTTCTGTCTATCGTTAACCGCGTTCGCCAGGGTTACTCTCCGCTGTCTTT 
TCAGACTCGCCTGCCGAACCCGCGCGGTCCGGACCGCCCGGAAGGTATCGATGAAGAAGGTGGTGAACGC 
GACCGCGACCGCTCTACTCGCCTGGTAGATATCTCTCTGGCTCTGGTTTGGGAAGACCTGCGCTCTCTGT 
GCCTGTTTTCTTACCATCGCCTGCGCGACCTGCTGCTGATCGCTACTCGCATCGTTGAACTGCTGGGTCG 
CCGCGGTTGGGAAGTGCTGAAATACTGGTGGAACCTGCTGCAATACGTATCTCAGGAACTGAAAAACTCT 
GCTGTTTCTCTGGTTAATGCTACTGCTATCGCTGTTGCTGAAGGTACTGACCGCGTTATCGAAGTTGTTC 
AGCGCGCTTACCGCGCTATCCGCCATATCCATCGCCGCATCCGCCAGGGTCTGGAACGCATCCTGCTGCA 
GGTGCATGCCTCGAGTCTAGAAAGCTCATGGCAATTCGGGCCCGGGTAA 



4. The method of Claim 3 wherein the heterologous host is E. coli. 

5. The method of Claim 3 wherein said DNA sequence is fused to a second DNA sequence coding for a prokaryotic 
or eukaryotic protein whereby a fusion protein is expressed comprising said amino acid sequence fused to a 
prokaryotic or eukaryotic protein. 

6. The method of Claim 5 wherein said prokaryotic or eukaryotic protein is the E. colj enzyme CKS. 

7. The method of Claim 3 wherein said DNA sequence is fused to a second DNA sequence coding for a HIV gag 
protein whereby a fusion protein is expressed comprising said amino acid sequence fused to a HIV gag protein. 

8. The method of Claim 7 wherein said second DNA sequence codes for a HIV-1 gag protein comprising an amino 
acid sequence represented by the following: 

DTGHSSQVSQNYPIVONIQGOMVHQAISPRTLNAWVKVVEEKAFSPEVIPHFSALSEGATPOOLNTMLiNT 
VGGHOAAMOMLKETINEEAA^WDRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTHNPPIPVGE 
IYKRWIILGLNKIVRMYSPTSILDIROGPKEPFRDYVDRFYKTLRAEQASQEVKNWMTETLLyQNAMPDC 
KTILKALGPAATLEEHMTACQGVGGPGHKARVLAEAMSQVTNTATII-lMQRGNFRNQRKMVKCFMCGKEGr: 
TARNCRA 



Patentanspruche 

Patentanspruche fur folgende Vertragsstaaten : AT, BE, CH, LI, DE, FR, GB, GR, IT, NL, SE 

1. Polypeptid, das eine Aminosauresequenz umfaflt, die wie folgt dargestellt werden kann: 

MGDPMMRDNWRSELYKYKVVKIEPLGIAPTKAKRRVVQREKRADLAVGILGALFLGFLGAAGSTMGARSL 
TLTVQARQLLSGIVQQQNNLLRAIKDPKAQQHLLQLTVWGIKQLQARVLAVERY LKDQQLLGIWGCSGKL 

jRORSTRLVDISULVWEDLRSLCLFSY 

AVSLVNATAIAVAEGTORVIEVVQRAYRAIRHIHRRIRQGLERILLQVHASSLESSWQFGPG. 

2. Polypeptid nach Anspruch 1 , das von E. coli hergestellt wird. 
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3. Fusionspolypeptid, das ein Polypeptid nach Anspruch 1 umfaBt, bel dem das Polypeptid an ein prokaryontisches 
oder eukaryontischen Protein fusioniert ist. 

4. Fusionspolypeptid nach Anspruch 3, wobei das prokaryontische oder eukaryontische Protein das E. coli -Enzym 
CKSist. 

5. Synthetisches Gen, das eine DNA-Sequenz umfaBt, die wie folgt dargestellt werden kann: 

ATGGGGGATCCCATGATGCGCGACAACTGGCGCTCTGAACTGTACAAATACAAAGTTGTTAAAATCGAAC 
CGCTGGGCATCGCTCCGACCAAAGCTAAACGCCGCGTTGTTCAGCGCGAAAAACGCGCAGATCTAGCTGT 
TGGTATCCTGGGTGCTCTGTTTCTGGGTTTTCTGGGTGCTGCTGGTTCTACTATGGGTGCTCGCTCTCTG 
ACTCTGACTGTTCAGGCTCGCCAGCTGCTGTCTGGTATCGTTCAGCAGCAGAACAACCTGCTGCGCGCTA 
TCAAGGATCCCAAAGCTCAGCAGCATCTGCTGCAACTGACTGTTTGGGGTATCAAACAACTGCAGGCTCG 
CGTTCTGGCTGTTGAACGCTACCTGAAAGACCAGCAGCTGCTGGGTATCTGGGGTTGCTCTGGTAAACTG 
ATTTGCACTACTGCCGTTCCGTGGAACGCTTCTTGGTCCAACAAATCTCTGGAAGACATCTGGAACAACA 
TGACTTGGATGCAATGGGAACGCGAAATCAACAACTACACTAACCTGATCTACTCTCTGCTGGAAGAATC 
TCAGAACCAGCAGGAAAAAAACGAACAGGAACTGCTGCAACTGGACAAATGGGTCGACGCTTCTCTGTGG 
AACTGGTCTAACATAACTAAATGGCTGTGGTACATCAAACTGTTTATCATGATCGTTGGTGGTCTGGCCG 
GCCTGCGCATCGTTTTTGCTGTTCTGTCTATCGTTAACCGCGTTCGCCAGGGTTACTCTCCGCTGTCTTT 
TCAGACTCGCCTGCCGAACCCGCGCGGTCCGGACCGCCCGGAAGGTATCGATGAAGAAGGTGGTGAACGC 
GACCGCGACCGCTCTACTCGCCTGGTAGATATCTCTCTGGCTCTGGTTTGGGAAGACCTGCGCTCTCTGT 
GCCTGT7TTCTTACCATCGCCTGCGCGACCTGCTGCTGATCGCTACTCGCATCGTTGAACTGCTGGGTCG 
tCGCGGTTGGGAAGTGCTGAAATACTGGTGGAACCTGCTGCAATACGTATCTCAGGAACTGAAAAACTCT 
GCTGTTTCTCTGGTTAATGCTACTGCTATCGCTGTTGCTGAAGGTACTGACCGCGTTATCGAAGTTGTTC 
AGCGCGCTTACCGCGCTATCCGCCATATCCATCGCCGCATCCGCCAGGGTCTGGAACGCATGCTGCTGCA 
GGTGCATGCCTCGAGTCTAGAAAGCTCATGGCAATTCGGGCCCGGGTAA. 

6. Synthetisches Gen nach Anspruch 5, das fur ein Polypeptid nach Anspruch 1 codiert. 

7. Fusionspolypeptid, das ein Polypeptid nach Anspruch 1 umfaBt, bei dem das Polypeptid an ein HIV-gag-Protein 
fusioniert ist. 

8. Fusionspolypeptid nach Anspruch 7, worin das HIV-gag-Protein ein HIV-1 -gag-Protein ist, das eine Aminosaure- 
sequenz umfaBt, die durch folgendes dargestellt werden kann: 

TARNCRA 

9. Verfahren zum Nachweis von Antikorpem gegen HIV-Antigene bei einem Individuum, das folgende Schritte umfaBt: 

a) Erhalt einer Probe einer Kdrperflussigkeit des Individuums; 

b) Inkubation der Kdrperflussigkeit mit dem Polypeptid nach Anspruch 1 ; 

c) Inkubation der Korperflussigkeit mit einem markierten Antikorper gegen Immunglobulin; und 

d) Nachweis der Markierung und daraus Bestimmung der Anwesenheit oder Abwesenheit von Antikorpem 
gegen HIV-Antigene. 

10. Verfahren zum Nachweis von Antikorpem gegen HIV-Antigene bei einem Individuum, das folgende Schritte umfaBt: 

a) Erhalt einer Probe von Korperflussigkeit des Individuums; 

b) Inkubation der Korperflussigkeit mit dem Polypeptid nach Anspruch 1 ; 

c) Inkubation der Korperflussigkeit mit einem markierten Antigen; und 

d) Nachweis der Markierung, wobei dadurch die Anwesenheit oder Abwesenheit von Antikorpem gegen HIV- 
Antigene bestimmt wird. 
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Patentanspruche fur folgenden Vertragsstaat : ES 

1. Verfahrenzum Nachweis von Antikorpem gegen HIV-Antigene bei einem Individuum, dasfolgendeSchritte umfaQt: 

a) Erhalt eine Probe einer Korperflussigkeit des Individuums, 

b) Inkubation der Korperflussigkeit mit einem Polypeptid, das eine Aminosauresequenz umfaQt, die durch 
folgendes dargestellt werden kann: 

MGDPMMRDNWRSELYKYKVVKIEPLGiAPTKAKRRVVQREKRAOLAVGILGALFLGFLGAAGSTMGARSL 

TLTVQARQLLSGIVQQQNNLLRAIKDPKAQQHLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKL 
ICTTAVPWNASWSNKSLEDIWNNMTWMQWEREINNYT^ 

NWSNITKWLWYIKLFIMIVGGLAGIRIVFAVISIVNRVRQGYSPI^ 

DRDRSTRLVOISLALVWEDLRSLCLFSYHRLRDLLLIATRIVELLGRRGWEVUYWWNLLQYVSQELKNS 
AYSLVNATAIAVAEGTDRVIEVVQRAYRAIRHIHRRIRQGLERILLQVHASSLESSWQFGPG . 

c) Inkubation der Korperfussigkeit mit einem markierten Antikdrper gegen Immunglobulin; und 

d) Nachweis der Markierung und daraus Bestimmung der Anwesenheit oder Abwesenheit von Antikorpem 
gegen HIV-Antigene. 

2. Verfahren zum Nachweis von Antikorpem gegen HIV-Antigene bei einem Individuum, das tolgende Schritte umfaQt: 

a) Erhalt einer Probe von Korperflussigkeit des Individuums, 

b) Inkubation der Korperflussigkeit mit einem Polypeptid, das eine Aminosauresequenz umfaQt, die wie folgt 
dargestellt werden kann: 

.MGDPMMRDNWRSELYKYKVVKIEPLGIAPTKAKRR 

TLTVQARQLLSGIVQQQNNLLRAIKDPKAQQHLLOLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKL 
ICTTAVPWNASWSNKSLEDIWNNMTWMQWEREINNYTNLIYSLLEESQNQQEKNEQELLQLDKWVDASLW 
NWSNITKWLWYIKLFIHIVGGLAGLRIVFAVLSIVNRVRQGYSPLS FQTRLPNPRGPORPEGIDEEGGER 
DRDRSTRLVOISLALVWEDLRSLCLFSYHRLRDLLLIATRIVELLGRRGWEVLKYWWNLLQYVSQELKNS 
AVSLVNATAIAVAEGTORVIEVVQRAYRAIRHIHRRIRQGLERILLQVHASSLESSWQFGPG 

c) Inkubation der Korperflussigkeit mit einem markierten Antigen, und 

d) Nachweis der Markierung und daraus Bestimmung der Anwesenheit oder-Abwesenheit von Antikorpem 
gegen HIV-Antigene. 

3. Verfahren zur Herstellung eines Polypeptids, das eine Aminosauresequenz umfaGt, die durch folgendes dargestellt 
werden kann: 



ICTTAVPWNASWSNKSLEDIWNNM^ 
AVSLWTAIAWSTORVltvvqWY^ 

wobei das Verfahren das molekulare Klonen und die Expression eines synthetischen Gens in einem heterologen 
Wirt umfaQt, wobei das Gen eine DNA-Sequenz umfaQt, die wie folgt dargestellt werden kann: 
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atgggggatcccatgatgcgcgacaactggcgctctgaactgtacaaata'caaagttgttiuaatcgiac 
cgctgggcatcgctccgaccaaagctaaacgccgcgttg™ 

TGGTATCCTGGGTGCTCTGTTTCTGGGTTTTCTGGGTKTGCTGGTT^ 

actctgactgttcaggctcgccagctgctgtctggtatcgttcagcagcawacaacctgctgcg^ 

TCAAGGATCCCAAAGCTCAGCAGCATCTGCTGCAACTGACTGTTTGGGGTATCAAACAACTK 

CGTTCTGGCTGTTGAACGCTACCTGAAAGACCAGCAGCTGCTGGGTATCTGGGGTTGCTCTGGTAAA 

ATTTGCACTACTGCCGTTCCGTGGAACGCTTCnGGTCCAACMATCTCT^ 

TGACTTGGATGCAATGGGAACGCGMATCAACMCTACACTAACCTGATCTACTCTCTG^ 

TCAGAACWGCAGGAAMAAACGAACAGGAACTGCTGCAACTGGACAAATGGGTCGACGCTTCTCTGTGG 

AACTGGTCTAACATAACTAAATGGCTGTGGTACATCAMCTGTTTATCATGATCGTTGGTGGTCTG^ 

GCCTGCGCATCGTTTTTGCTGTTCTGTCTATCGTTAACCGCGnCGC 

TCAGACTCGCCTGXCGMCCCGCGCGGTCCGGACCGCCCGG^ 

GACCGCGACCGCTCTACTCGCCTGGTAGATATCTCTCTGGCTCTGGTTTGGGAAGACCTGCGCTCTCTGT 
GCCTGT7TTCTTACCATCGCCTGCGCGACCTGCTGCTGATCGCTACTO 

CCGCGGTTGGGAAGTGCTGAAATACTGGTGGAACCTGCTGCAATACGTATCTCAGGAACTGAAAAACTCT 
GCTGTTTCTCTGGTTAATGCTACTGCTATCGCTGTTGCTGAAGGTACTGACCGCG^ATCGAAGTTGTTC 
AGCGCGCTTACCGCGCTATCCGCCATATCCATCGCCGCATCCGCCAGGGTCTGGAACGCATCCTGCTGCA 
GGTGCATGCCTCGAGTCTAGAAAGCTCATGGCAATTCGGGCCCGGGTAA , 

4. Verfahren nach Anspruch 3, wobei der heterologe Wirt E. Coli. ist. 

5. Verfahren nach Anspruch 3, wobei die DNA-Sequenz an eine zweite DNA-Sequenz fusioniert ist, die fur ein pro- 
karyontisches oder eukaryontisches Protein codiert, wobei ein Fusionsprotein exprimiert wird, das die Aminosau- 
resequenz umfaBt, die an ein prokaryontisches oder eukaryontisches Protein fusioniert ist. 

6. Verfahren nach Anspruch 5, wobei das prokaryontische oder eukaryontische Protein das E. Coli -Enzym CKS ist. 

7. Verfahren nach Anspruch 3, wobei die DNA-Sequenz an eine zweite DNA-Sequenz fusioniert ist, die fur ein HIV- 
gag-Protein codiert, wodurch ein Fusionsprotein exprimiert wird, das die Aminosauresequenz umfaGt, die an ein 
HIV-gag-Protein fusioniert ist. 

8. Verfahren nach Anspruch 7, wobei die zweite DNA-Sequenz fur ein H I V-1 -gag-Protein codiert, das eine Amino- 
sauresequenz umfaBt, die wie folgt dargestellt werden kann: 




iYKRWIILGLNKIVWYSPTSILDIROGPKEPFRDYVDRFYKTLRAEQASOEVKNWrtTETLLVQNAiNPDC 

KTILKALGPAATLEEMMTACQGVGGPGK^ 

TARNCRA. 



Revendications 

Revendications pour les Etats contractants suivants : AT, BE, CH, LI, DE, FR, GB, GR, IT, NL, SE 

1. Polypeptide comprenant une sequence d'aminoacides representee par la sequence suivante: 



MGDPMMRDNWRSELYKYKVVKIEPLGIAPTKAKRRVVQREKRADLAVGILGALFLGFLGAAGSTMGARSL 
TLTVQARQLLSGIVQQQNNLLRAIKDPKAOQHLLQLTVWGIKQLQARVLAVERYLICOQQLLGIWGCSGKL 
ICTTAVPWNASWSNKSLEOIWNNMTWMQWEREINNYTNLIYSLLEESQNQQEKNEOELLOLDKWVDASLW 
NWSNITKWLWYIKLFIMIVGGLAGLRIVFAVLSIVNRVRQGYSPLSFQTRLPNPRGPDRPEGIOEEGGER 
DRDRSTRLVDISLALVWEDLRSLCLFSYHRLRDLLLIATRIVELLGRRGWEVLKYWWNLLQYVSQELKNS 
AVSLVNATAIAVAEGTORVIEVVQRAYRAIRHIHRRIRQGLERILLQVKASSLESSWQFGPG. 



2. Polypeptide selon la revendication 1 , produit par E. coli. 
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3. Polypeptide de fusion comprenant un polypeptide selon la revendication 1 , dans lequel le polypeptide est f usionne 
avec une proteine eucaryote ou procaryote.. 

4. Polypeptide de fusion selon la revendication 3, dans lequel ladite proteine eucaryote ou procaryote est I'enzyme 
CKS d'E. coli. 

5. Gene synthetique comprenant une sequence d'ADN representee par la sequence suivante: 

A7GGGGGA7CCCA7GA7GCGCGACAAC7GGCGC7C7GAAC7G7ACAAA7ACAAAG77G77AAAA7CGAAC 
CGC7GGGCATCGC7CCGACCAAAGC7AAACGCCGCG77G77CAGCGCGAAAAACGCGCAGA7C7AGC7G7 
TGGTATCCTGGGTGCTCTGTTTCTGGG7T7TCTGGGTGCTGCTGGTTCTACTATGGG7GCTCGCTCTCTG 
ACTC7GAC7G77CAGGCTCGCCAGC7GC7G7C7GG7A7CG77CAGCAGCAGAACAACC7GC7GCGCGC7A 
7CAAGGA7CCCAAAGG7CAGCAGCA7C7GC7GCAAC7GAC7G777GGGG7A7CAAACAAC7GCAGGC7CG 
CG77C7GGCTG77GAACGC7ACC7GAAAGACCAGCAGC7GC7GGG7ATC7GGGG77GCTC7GG7AAAC7G 
AT77GCAC7AC7GCCG77CCG7GGAACGC77C77GG7CCAACAAA7CTC7GGAAGACATC7GGAACAACA 
7GAC77GGATGCAA7GGGAACGCGAAA7CAACAAC7ACAC7AACC7GATC7AC7C7C7GC7GGAAGAA7C 
7CAGAACCAGCAGGAAAAAAACGAACAGGAACTGC7GCAAC7GGACAAA7GGG7CGACGC77C7C7G7GG 
AAC7GG7C7AACA7AAC7AAA7GGC7G7GG7ACA7CAAAC7G777A7CA7GATCG77GG7GG7C7GGCCG 
GCC7GCGCA7CG77777GC7G77C7G7CTA7CG77AACCGCG77CGCCAGGG77AC7C7CCGC7G7C777 
TCAGAC7CGCC7GCCGAACCCGCGCGG7CCGGACCGCCCGGAAGG7ATCGA7GAAGAAGG7GG7GAACGC 
GACCGCGACCGC7C7AC7CGCC7GG7AGA7A7C7C7C7GGC7C7GG777GGGAAGACC7GCGC7C7C7G7 
GCCTG7777C77ACCA7CGCC7GCGCGACC7GC7GC7GA7CGC7AC7CGCA7CG77GAAC7GC7GGG7CG 
CCGCGG77GGGAAG7GC7GAAA7AC7GGTGGAACC7GC7GCAATACGTA7C7CAGGAAC7GAAAAAC7C7 
GCTG777C7C7GG77AA7GC7AC7GC7ATCGCTG77GC7GAAGG7AC7GACCGCG77A7CGAAG77G77C 
AGCGCGC7TACCGCGCTATCCGCCATA7CCATCGCCGCA7CCGCCAGGG7C7GGAACGCATCCTGC7GCA 
GGTGCATGCC7CGAG7C7AGAAAGC7CATGGCAA77CGGGCCCGGG7AA . 



6. Gene synthetique selon la revendication 5, codant pour le polypeptide de la revendication 1 . 

7. Polypeptide de fusion comprenant un polypeptide selon la revendication 1 , dans lequel le polypeptide est fusionne 
avec une proline gag de HI V. 

8. Polypeptide de fusion selon la revendication 7, dans lequel ladite proteine gag de HIV est une proteine gag de 
HIV-1 comprenant une sequence d'aminoacides representee par la sequence suivante: 



D7GHSS0VS0NYP!V0HIQGGMVK0AIS?R7LNAWVKVVE£:<ArS?Eyi?MFSALSEGA7P00LN7iMLN7 
VGGHOAAMOMLK E7 1 N E EAA EV DRV H ? V KAG? I APGOMRE? RG SO I AG77S 7L0EQ I GWM7NN PP I P VG E 
I Y XRW 1 1 LG Lii X I V RM Y S P 7 S I LD I RGG ? KE ? F RD Y V D R F Y K7 L RA EQ A S 0 EV KNWM7 E7 L L 70 MA N P 0 C 
K7 1 L KA IG ? AA 7 L E EMM7 AC QG V GG r GH KARV LA EAM S Q V 7N 7 A 7 1 MMQ RG N FRNQR KM V KC r M CGX EG H 



9. Procede de detection d'anticorps diriges contre des antigenes de HIV chez un individu, qui comprend les 6tapes 
selon lesquelles: 

a) on preleve un 6chantillon d'un liquide corporel de I'individu; 

b) on met ledit liquide corporel a incuber avec ledit polypeptide de la revendication 1 ; 

c) on met ledit liquide corporel a incuber avec un anticorps anti-immunoglobuline marque; et 

d) on detecte ledit marqueur et on en deduit la presence ou I'absence d'anticorps dirig6s contre les antigenes 
deHIV 

10. Procede de detection d'anticorps diriges contre des antigenes de HIV chez un individu, qui comprend les etapes 
selon lesquelles: 

a) on preleve un echantillon d'un liquide corporel de I'individu; 

b) on met ledit liquide corporel a incuber avec ledit polypeptide de la revendication 1 ; 

c) on met ledit liquide corporel a incuber avec un antigene marque; et 

d) on detecte ledit marqueur et on en deduit la presence ou I'absence d'anticorps diriges contre les antigenes 
de HIV. 
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Revendications pour I'Etat contractant suivant : ES 



1. Proced6 de detection d'anticorps diriges contre des antigenes de HIV chez un individu, qui comprend les etapes 
selon lesquelles: 



a) on preleve un Echantillon d'un liquide corporel de I'individu; 

b) on met ledit liquide corporel a incuber avec un polypeptide comprenant une sequence d'aminoacides re- 
presentee par la sequence suivante: 



MGOPMMRDNWRSELYKYKVVKIEPLGiAPTKAKRRVVQREKRAOLAVGILGALFLGFLGAAGSTMGARSL 
TLTVQARQLLSGIVQQQNNLLRAIKDPKAQQHLL0LTVWGIKQL0ARVLAVERYLKOQQLLGIWGCSGKL 
ICTTAVPWNASWSNKSLEDIWNNMTWMOWEREINNYTNLIYSLLEES0NQQEKNE0ELL0LDKWVOASLW 
NWSNITKWLWYIKLFIMIVGGLAGLRIVFAVLSIVNRVRQGYSPLSFQTRLPNPRGPORPEGIDlEGGER 
DRORSTRLVOISLALVWEDLRSLCLFSYHRLROLLLIATRIVELLGRRGWEVLXYWWNLLQYVSQELKNS 
'AVSLVNATAIAVAEGTDRVIEVVQRAYRAIRHIHRRIRQGLERILLQVHASSLESSWQFGPG. 



c) on met ledit liquide corporel a incuber avec un anticorps anti-immunoglobuline marque; et 

d) on detecte ledit marqueur et on en deduit la presence ou Pabsence d'anticorps diriges contre les antigenes 
de HIV. 

2. Precede de detection d'anticorps diriges contre des antigenes de HIV chez un individu, qui comprend les etapes 
selon lesquelles: 

a) on preleve un echantillon d'un liquide corporel de I'individu; 

b) on met ledit liquide corporel a incuber avec un polypeptide comprenant une sequence d'aminoacides re- 
presentee par ia sequence suivante: 



c) on met ledit liquide corporel a incuber avec un antigene marque; et 

d) on detecte ledit marqueur et on en deduit la presence ou I'absence d'anticorps diriges contre les antigenes 
de HIV. 

3. Proced6 de production d'un polypeptide comprenant une sequence d'aminoacides representee par la sequence 
suivante: 



MGDPMMRDNWRSEIYKYKYVKIEPLGIAPTKAKRRVVQREKRAD 
ICTTAVPWNASWSN^ 

NU^NTTKWL WYT<i ^TMIVGGLAGLRI VF AVLS I VNRVRQGYSPLSFQTRLPNPRGPORPE^iDE^GGcR 

DRORSTRLVOISLU 
AVSLVNATAIAVAEGTDRVIE 



ledit precede comprenant le clonage moleculaire et I'expression dans un hdte heterologue d'un gene synthetique 
comprenant une sequence d'ADN representee par la sequence suivante: 
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20 



30 




TGACTTgStc^ 
TC'GAACC^ 

io I^CTGGTCTAACATAACTAAATG^ 
GrrTGCGCATCGTTTTTC 
Tf "GACTCGCCTGCCGAACCC^ 
^CCGCGACCGCTCTACTC^ 
rcCTGTTTTCTTACCATCGC^ 
rr^CGGTTGGGAAGTGCTGAAA 
rrTPTTTCTC'TGGTTAATGC 

15 ^GCGCGCTTACCGCGCTAK 

G^TGCATGCCTCGAGTCTAGMAGC 



4. ProcedS selon la revendication 3, dans lequel I'hote heterologue est £ coll 

5. Precede selon la revendication 3, dans lequel ladite sequence d'ADN est fusionn£e avec une seconde sequence 
d'ADN codant pour une proteine procaryote ou eucaryote, grace a quoi est exprimee une proteine de fusion com- 
prenant ladite sequence d'aminoacides fusionn6e avec une proteine procaryote ou eucaryote. 

^5 6. Procede selon la revendication 5, dans lequel ladite proteine procaryote ou eucaryote est I'enzyme CKS d'E cofi. 

7. ProcSde selon la revendication 3, dans lequel ladite sequence d'ADN est fusionnee avec une seconde sequence 
d'ADN codant pour une proteine gag de HIV, grace a quoi est exprimee une proteine de fusion comprenant ladite 
sequence d'aminoacides fusionnee avec une proline gag de HIV 



Proced6 selon la revendication 7, dans lequel ladite seconde sequence d'ADN code pour une proteine gag de 
HIV-1 comprenant une sequence d'aminoacides representee par la sequence suivante: 



dtghssovsonypivo.niqgomvhoalsprtlnawvkvveekafspevifmfsalscgatfgolntmlnt 
vgghoaamomlketineeaaewdrvh?vhag?iafgomre?rgsoiagttstlqeoigwmthnppi?vge 
i y k rw 1 1 lg ln k i v rm y s f t 5 1 ld i rog p k e p f ro y v d r fy k7 l-.a eqas q c v knwht et i l vq n an fcc 
ktilkalgpaatleemmtacqgvggfgh:<arvlaeamsqvtntatimmqrgnfrnqr:oiv:<cf.ncgk 

TARNCRA 



45 



50 



55 
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Clustered order of selected sequences: 



2. CDC42 FRAG. PEP 

3. BH102FRAG.PEP 

4. SF2FRAG.PEP 
1. MALFRAG.PEP 

5. SYNFRAG.PEP 



1-107] 
1-107 
1-107 
^1-1071 
(1-107) 



1 kAQQHLLQL' 

. mm 



VWSIKQLOARILAVERVLl!999U?fW?Wf n WW?'f? l f l f t V ( ' 1 



Vfti 

u 




WNNMT 



WNNM" 
idNH" 
lilnNH' 



KAQQHLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKLICTTAVPWNASWSNKSLEDIWNNMT 



dNY 

I 

drr 
si" 



hL 
1st 



nt 



lytLIEESQNQQEKNqQELLqLDKW 

iiiaii 



69 WMEWDRE 

: in 

69 WMQWEREINNYTNLIYSLLEESQNQQEKNEQELLQLDKW 



FIGURE 1 



17 



EP 0 370 458 B1 

ASSEMBLY AND CLONING OF BS2-10 
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Sn>a- 

EcoRl Aval BamHI 



1 GiATTCGAGCTCGnTAC^cicG^ATCCCATGatacgcQacaact^gcgctctgaactgiacaaatac^a 69 
AsnSerSerSerVal ProGJyAspProMUHEIArgAspAsnTrpArgSerGluLeufyrLyslyrLy 



2 18 23 

20 

C-tem gpl?0 



70 agttgttaaaatcgaaccgctgggcatcgctccgaccaaagctaaacgccgcgttgttcagcgcgaaaa 138 
sValValLyslleGluProLeubfylleAlaProrhrLysAlaLysArgArgValValGlnArgGluLy 



^ Bg UI 

139 acgcgcaGATCTAgctgttggtatcetgggtgctctgtttctoggttttctgggtgctgctggUcta^ 207 
sArgAlaAspLeuAlaValGlyneleuSW^ 



415 



BamHI 

277 



gaacaacctgctgcgcgctatcAAGiATcccaaagctcagcagcatctgctgcaactgactatttggga 345 
nAsnAsnLeuLeuArgAlanelysAsgProLysAlaGlnGlnHfsLeuLeuGlnLeuThrVaUrpGl 



BS2-10 



346 ^f^^^fS£l2^2X?a?|^^^X^Ug?SS?Sl^£^S«^fSS?S£42E^22?^fJ 4M 

415 ctggggttgctctggtaaactgatttgcactactgccgttccgtggaacgcttcttggtccaacaaatc 483 
eTrpGlyCysSerGlyLysLeuIleCysIhrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSe 

484 tctggaagacatctggaacaacatgacttggatgcaatgggaacgcgaaa 
rleuGluAsplleTrpAsnAsnHETThrrrpHEfGlnTrpGluArgGluI 



aaatcaacaactacactaacct 552 
" leAsnAsnfyrThrAsnLe 



553 !«Si;3SK^ 621 

?•» AM __ ^ 

622 caaatg^t^ACgcttctctgtggaactggtctaacataactaaatggctgtggtacatcaaactgtt 690 W 

pLysT?pValAspAlaScrleufrpA$n1rpSerA$nHeThrLysTrplcuTrpTyrneLysLeuPh §5 

630 g3 

Hpal u_ 



6,1 ttisiiiuisfflssttaafa^^^ 759 

752 



413-2 



760 tcgccagggttactctccgctgtcttttcagactcgcctgccgaacccgcgcgotccggaccgcccgoa 828 
lArgGlnGlyTyrSerProleuSerPheGlnThrArgLeuProAsnProArgGlyProAspArgProGI 



coRV 



829 aggtatcgatgaagaaggtggtgaacgcgaccgcgaccgctctactcgcctggtagatatetctctggc 897 
uGIyneAspGluGluGIyGTyi^ 

887 

, . 

898 tctggtttgggaagacctgcgctctctgtgcctgttttcttaccatcgcctgcgcgacctgctgctgat 966 
ateuVal TrpGl uAspLeyArgSerLeuCysLcuPheSerTyrHI sArgLeuArgAspLeuLeuLeuI 1 

967 ^fss^^KUfs^w^Ke?^:!^:};^ 



tactggtggaacctgct 1035 
* 'rpTrpAsnLeule 



naBt 



1036 gcaatacgtatctcaggaactgaaaaactctgctgtttctctggttaatgctactgctatcgctgttgc 1104 
uGlnTyrValSei^lnGluLeuLysAsnSerAUValSerLeuValAsnAlaThrAlalleAlaValAI 



104 



1105 tgaaggtactgaccgcgttatcgaagttgttcagcgcgcttaccgcgctatccgccatatccatcgccg 1173 
aGluGlyrhrAspAfgValMeSluVal^ 



Aval HIndlH 
tCGAGTCTAGAAAGCTT 1233 



1217 1229 
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Amino Alphabet = Identity 

Output line length = 80 

Compress = Off 

Randomization = Off 

AMINO-Res-length = 2 

DEletion-weight = 1.00 

LEngth-factor = 0 

Matching-weight ■ 1.00 

NUCLEIC-Res-length - 4 

SPread-factor = 50 

Clustered order of selected sequences: 



9. MAL 

10. ELI 

13. Z6 

4. C0C42 
12. RF 

11. WMJ22 

7. BH8 

8. PV22 

2. BRU 
1. HXB2 
6. BH102 

14. HX83 

3. SF2 

5. SYNGENE 



1-384 
1-383 
1-383 
1-384 
1-384 
1-384 
1-383 
1-383 
1-383 
1-383 
1-383 
(1-383 
* 1-384 
1-413 



FIGURE 4-1 



Region Alignment: (listed in Clustered order) 
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FIGURE H-2 
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FIGURE 4-3 
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FIGURE W 
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PSP301.PEP 

10 20 30 40 50 60 70 

MGDPMMRONW RSELYKYKVV KIEPLGIAPT KAKRRVVQRE KRADLAVGIL GALFLGFLGA AGSTMGARSL 

linkerh HIV-1 env seq— *■ ^ m 130 140 

TLTVQARQLL SGIVQQQNNL LRAIKDPKAQ QHLLQLTVWG IKQLQARVLA VERYLKDQQL LGIWGCSGKL 

icq i 6 o +++ 170 180 190 " 200 210 

ICTTAVPWNA SWSNKSLEDI WNNMTWMQWE REINNYTNLI YSLLEESQNQ QEKNEQELLQ LDKWVDASLW 

2?n 230 240 250 260 270 280 

NWSNITKWLW YIKLFIMIVG GLAGLRIVFA VLSIVNRVRQ GYSPLSFQTR LPNPRGPDRP EGIDEEGGER 

?Qft 300 * 310 320 330 340 350 

DRDRSTRLVD ISLALVWEDL RSLCLFSYHR LRDLLLIATR IVELLGRRGW EVLKYWWNLL QYVSQELKNS 

•»60 * 370 380 390 400 410 

AVSLVNATAI AVAEGTDRVI EVVQRAYRAI RHIHRRIRQG LERILLQVHA SSLESSWQFG PG. 

f- linker seq 

PSD302.PEP 

in 20 30 40 50 60 70 

MTMITPSLAA GPDTGHSSQV SQNYPIVQNI QGQHVHQAIS PRTLNAWVKV VEEKAFSPEV IPMFSALSEG 



linker seq rHIV-1 ma seq->- lQQ m m UQ 140 

ATPQDLNTML NTVGGHQAAM QMLKETINEE AAEWDRVHPV HAGPIAPGQM REPRGSOIAG TTSTLQEQIG 

icn 160 170 180. 190 200 210 

WMTNNPPIPV GEIYKRWIIL GLNKIVRMYS PTSILDIRQG PKEPFRDYVD RFYKTLRAEQ ASQEVKNWMT 

??n 230 240 250 260 270 280 

ETLLVQNANP DCKTILKALG PAATLEEMMT ACQGVGGPGH KARVLAEAHS QVTNTATIMM QRGNFRNQRK 

?on 300 310 320 330 340 350 

HVKCFNCGKE GHTARNCRAP GDPMHRDNWR SELYKYKVVK IEPLGIAPTK AKRRVVQREK RADUVGILG 

360 37 l o 11nker, " HIV 38 1 0 enVSeq ~90 400 410 420 

ALFLGFLGAA GSTMGARSLT LTVQARQLLS GIVQQQNNLL RAIKDPKAQQ HLLQLTVWGI KQLQARVLAV 

A->n Aid 450 460 470 480 490 

ERYLKDQQLL GIWGCSGKLI CTTAVPWNAS WSNKSLEDIW NNMTWMQWER EINNYTNLIY SLLEESQNQQ 

cnn 510 520 530 540 550 .560 

EKNEQELLQL DKWVDASLWN WSNITKWLWY IKLFIMIVGG LAGLRIVFAV LSIVNRVRQG YSPLSFQTRL 

570 ** 580 590 ..600 .* 610 ,620 unti . B JM 



PNPRGPDRPE GIDEEGGERD RDRSTRLVDI SLALVWEDLR SLCLFSYHRL RDLLLIATRI VELLGRRGWE 

fian * 650 660 670 680 690 700 

VLKYWWNLLQ YVSQELKNSA VSLVNATAIA VAEGTDRVIE VVQRAYRAIR HIHRRIRQGL ERILLCjVHAS^ 

RVIN. 
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